Server Allocation Problem for Multi-Tiered Applications

نویسندگان

  • Kamalika Chaudhuri
  • Anshul Kothari
  • Ram Swaminathan
  • Robert Tarjan
  • Alex Zhang
  • Yunhong Zhou
چکیده

Last few years have seen exponential growth in the area of web applications, especially, e-commerce and web services. One of the most important QoS metric for web applications is the response time for the user. Web application normally has a multi-tier architecture and a request might have to traverse through all the tiers before finishing its processing. Therefore, a request’s total response time is the sum of response time at all the tiers. Since the expected response time at any tier depends upon the number of servers allocated to this tier, many different configurations (number of servers allocated at each tier) can give the same QoS guarantee in terms of total response time. Naturally, one would like to find the configuration, which minimizes the total system cost and satisfies the total response time guarantee. Zhang et al. [15] have modeled this problem as a non-linear integer optimization problem and proposed heuristics to solve it optimally. In this paper we study computational complexity of this non-linear optimization problem, which we call the multi-tier problem. First we show, for the case of variable number of tiers, the decision version of this problem is NP-Complete. Then we present a simple two-approximation algorithm which runs in linear time and a fully polynomial time approximation scheme. For the case of constant number of tiers, we show that the problem is polynomial time solvable.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bi-objective optimization of multi-server intermodal hub-location-allocation problem in congested systems: modeling and solution

A new multi-objective intermodal hub-location-allocation problem is modeled in this paper in which both the origin and the destination hub facilities are modeled as an M/M/m queuing system. The problem is being formulated as a constrained bi-objective optimization model to minimize the total costs as well as minimizing the total system time. A small-size problem is solved on the GAMS software t...

متن کامل

Model-Based System Management for Multi-Tiered Servers

om plem nt Figure 1.3: Structure of this thesis forcement or the delay for obtaining a system state snapshot, both of which may be large. This thesis shows that intuitively correct performance optimization approaches, such as dynamic programming based resource-packing, do not necessarily yield the best cost-optimizing behavior when accounting for resource allocation delays and the fact that res...

متن کامل

Joint Allocation of Computational and Communication Resources to Improve Energy Efficiency in Cellular Networks

Mobile cloud computing (MCC) is a new technology that has been developed to overcome the restrictions of smart mobile devices (e.g. battery, processing power, storage capacity, etc.) to send a part of the program (with complex computing) to the cloud server (CS). In this paper, we study a multi-cell with multi-input and multi-output (MIMO) system in which the cell-interior users request service...

متن کامل

Energy-Efficient Joint Offloading and Wireless Resource Allocation Strategy in Multi-MEC Server Systems

Mobile edge computing (MEC) is an emerging paradigm that mobile devices can offload the computationintensive or latency-critical tasks to the nearby MEC servers, so as to save energy and extend battery life. Unlike the cloud server, MEC server is a small-scale data center deployed at a wireless access point, thus it is highly sensitive to both radio and computing resource. In this paper, we con...

متن کامل

Effectiveness of Dynamic Resource Allocation for Handling Internet Flash Crowds

Internet data centers host multiple Web applications on shared hardware resources. These data centers are typically provisioned to meet the expected peak demands of the hosted applications based on normal time-of-day effects. Such an over-provisioning approach is not robust to flash crowd scenarios, where the load increase of some hosted applications is much higher than their expected peak load...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004